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(57) Abstract 

The present invention provides non-peptidc 
cathepsin D binding compounds and methods 
for using such compounds in the detecdon, 
labelling and inhibition of cathepsin D. In 
formula (I), Ri, R2 and R3 alkyl. substituted 
alkyl. aryl, substituted aiyl. aiylalkyl. substituted 
aiylalkyl, aryloxyalkyl. substituted aiyloxyalkyl. 
heteroaryl. substituted hereroaryl, heteroarylalkyl, 
substituted heteroarylalkyl. heterocycles, substituted 
hcterocycles, heterocyclicalkyl and substittited 
heterocyclicalkyl. Rj and Re are independently 
selected from the group consisting of hydrogen. 

halogen, alkyl, substituted alkyl. aryl, substituted aryl. "aiylalkyl. substituted aiyldkyl. aryloxyalkyl and substituted aryloxyalkyl. or R^ and 
R* and the carbons to which they are bound join to form an optionally substimied 9- or lO-ring atom carbocyclic or heterocyclic fused 
ring system. 
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NANOMOLAR, NON-PEPTWE INHIBITORS OF CATBEPSIN D 

GOVERNMENT RIGHTS 
This invention was made with Government suppoit under Giant (Contract) 
Nos. ROl GMS3696 and ROl GMS0353 awarded by the National Institutes of Health. 
Hie Government has certain rights in this invention. 

FIELD OF THE INVENTION 

The present invention relates generally to substances which bind to and 
inhibit cathepsin D and to the use of these substances in various analytical, diagnostic 
and therapeutic methods based on this binding capability. 

BACKGROUND OF THE INVENTION 
A cherished goal of chemists is to design and synthesize compounds with a 
specific set of properties. This goal is particularly urgent in biological and medicinal 
chemistry as a part of the drug discovery process. Two powerful new tools in this effort 
are structure-based design (I. D. Kuntz, Science 257, 1078-1082 (1992).; I. D. Kuntz, et 
al, Accts. Chem. Res. 27, 117-123 (1994)) and combinatorial chemistry (L. A. 
Thompson, et al, Chem Rev. 96, 555-600 (1996); E. M. Gordon, etal.,J. Med. Chem. 
37, 1385-1401 (1994)). Structure-based design uses information gleaned from 
crystallographic and magnetic resonance e^riments on a target macromolecule, 
frequently an enzyme, to guide the selecdon or design of inhibitors. Computation plays 
a major role in this endeavor (I. D. Kuntz, a al, Accts. Chem. Res. 27, 117-123 
(1994); N. C. Cohen, ei al, J. Med. Chem. 33, 883-894 (1990)). Combinatorial 
chemistry is based on general chemical transformations that allow different building 
blocks to be combined in high yield. These transformations can be performed in parallel 
to synthesize libraries of related compounds n^idly and efficiently (L. A. Thompson, et 
Ql., Chem Rev. 96, 555-600 (1996); E. M. Gordon, et al, J. Med. Chem. 37, 
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1385-1401 (1994)). Nonetheless, the discovery of a new lead compound or the 
improvement of the properties f an existing lead are still draianding tasks. 

Combinatorial approaches to ligand identification initially focused on 
biqpolymer Ubraries prepared by either chemical or biological methods (M. A. Gallop, et 
al.,J. Med. Chem. 37, 1233-1251 (1994)). For these Ubraries, aD possible combinations 
of the buildmg blocks are typically used since there are only four natural nucleotide 
building blocks for aptamer libiaries and 20 proteinogenic amino acid building blocks for 
peptide libraries. Both the structures of the compounds and the theoretical number of 
compounds in the library are determined by setting the length of the biopolymer chain. 
Recendy, considerable efforts have been directed toward the prq)aration of libraries of 
compounds that encompass a wider spectrum of chemical transformations, leading to a 
broader range of properties than found in pq)tides or oligonucleotides (L. A. Thompson, 
et al, Chem Rev. 96, 555-600 (1996); E. M. Gordon, etal.,J. Med. Chem. 37, 
1385-1401 (1994)). These new approaches introduce significant challenges into library 
design. 

A crucial element of any library design is the procedure for selecting 
which compounds to synthesize. This includes the choice of the scaffold, the basic 
reactions and the nature of the building blocks. If the building blocks are leadUy 
available components such as amines, aldehydes or cartx)xylic acids, the number of 
potential compounds to be considered can be quite large. For example, combining three 
building blocks with thousands of components at each position leads to over 1 billion 
compounds. While different strategies have distinct practical limits, typically a 
researcher is prepared to synthesize only thousands of spatially separate compounds and 
tens of millions of compounds in mixtures. Furthermore, evaluation and deconvolution 
of a very large library become rate-limiting activities (N. K. Terrett, et al. , Bioorg. 
Med. Chem. Lett. 5, 917-922 (1995)). Thus, there would be significant advantages to a 
method of reducing the synthetic effort to a small subset of compounds biased towards 
the desired properties. 

How can the potential choices be efficiently reduced? The standard 
strategies are diversity selection and directed selection. Diversity approaches attempt to 
maximize the sampling of chemical and biological properties given a fixed number of 
compounds (R. J. Simon, etaL, Proc. Natl Acad. Sci. U.S.A. 89, 9367-9371 (1992)). 
In directed libraries the size and often the diversity of the library is reduced by selecting 
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those buUding blocks that are predicted to have favorable intenu:tions with the target, or 
by elimin a tin g candidates that are a priori believed to have unfavorable interactions. A 
directed libraiy can be based on substrate preferences, information about known 
inhibitors or, on an assessment of the potential interaction of specific functional groups 
with the target. Both diverse and directed strategies pennit a multistage attack with 
second libraries generated from active compounds found in the first round. 

The develc5)ment of general and efficient ^roaches to identify small, 
non-peptidic inhibitors of aspartic proteases continues to be of interest because of their 
important roles in therapeutically relevant processes (K. Takahashi, Ed., Aspartic 
Proteinases Structure, Function, Biology, and Biomedical Implications (Plenum Press, 
New York, 1995); J. Adams, et al, Ann. Rep. Med. Chem. 31, 279-288 (1996); J. J. 
Edmunds, et aL.Ann. Rep, Med. Chem. 31. 51-60 (1996); D. K. Miller, Ann. Rep. 
Med. Chem. 31, 249-268 (1996)). Aspartic acid proteases are a widely distributed 
family of enzymes that play important roles in fungi, plants, vertebrates and retroviruses. 
The aspartic acid proteases (characterized by having two aspartic acid residues in the 
active site) catalyze the hydrolysis of amide bonds with specificity for peptide bonds 
located between large hydrophobic residues. A number of aspartic acid proteases are 
important pharmaceutical targets, including renin, cathepsin D, the human 
immunodeficiency vims (HIV) proteases, human t-cell leukemia virus type 1 (HTLV-1) 
protease and Candida albicans aspartic acid protease. 

Potent inhibitors of these enzymes can be readily accessed by the 
incorporation of an isosterc that mimics the geometry of the tetrahedral intermediate in 
place of the scissile bond of the peptide substrate. Unfortunately, these inhibitors have 
limited therapeutic utility due to the poor oral availability and/or short circulating half- 
lives that result from their peptidic nature. For this reason, it would be advantageous if 
structure-based design and combinatorial chemistry techniques could be used to develop 
non-pcptide inhibitors of aspartic acid proteases. 

SUMMARY OF THE INVENTION 
Cathepsin D is a lysosomal enzyme that plays an important role in protem 
metabolism (Helseth, et al, Proc. Natl Acad. Sci. USA 81, 3302-3306 (1984)), 
catabolism (Kay, et al, Imracellular Protein Catabotism (eds. Katunuma, et al.), pp. 
155-162 (1989)), and antigen processing (Guagliardi, et al, Namre, 343, 133-139 
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(1990); Van Nooit, et al.. J. Biol Chem., 264, 14159.14164 (1989)). The present 
invention relates to non-pq)tide cathepsin D-binding compounds and to various uses of 
these compounds, both therapeutic and diagnostic, based on their cathepsin D-binding 
propeities. These methods include the use of the compoimds for detecting and 
quantitating the presence of cathepsin D in a biological sample for analytical or 
diagnostic puiposes, and the use of the compounds for inhibiting the ability of cathq)sin 
D to process proteins in living cells. 

In one embodiment, the present invention provides compounds that are 
useful as cathepsin D-binding compounds. Such compounds do not incoiporate any 
amino acids and generally have molecular weights of less than about 700-800 daltons. 
Moreover, these compounds have been found to be potent, non-peptide inhibitors of 
cathepsin D. Compounds falling within the scope of the present invention have the 
general structure: 




Formula I 

In Fonnula I, Rj, and R3 arc members independently selected from the group 
consisting of alkyl, substituted alkyl, aiyl, substituted aryl, arylalkyl, substituted 
arylalkyl, aryloxyalkyl, substituted aryloxyalkyl, heteroaiyl, substituted heteroaryl, 
heteroarylalkyl, substituted heteroarylalkyl, heterocycles, substituted heterocycles, 
heterocyclicalkyl and substituted heterocyclicalkyl. 

In Formula I, R, and R^ arc independentty selected from the group 
consisting of hydrogen, halogen, alkyl, substituted alkyl, aryl, substituted aiyl, arylalkyl, 
substituted arylalkyl, aryloxyalkyl and substituted aryloxyalkyl. In an alternative 
embodiment, R^ and R^ and the carbons to which they are bound, join to form an 
optionally substituted 9- or 10-ring atom carbocyclic or heterocycKc fused ring system. 
Typical 9- or 10-atom fused ring systems include, but are not limited to, napthalyl, 1,3- 
benzodioxolyl, 2,3-benzofuranyl, 1,4-benzodioxanyl, benzimidazoyl, benzothiazolyl ere. 
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Within the scope of the above Fonnula I, certain embodiments are 
preferred. In Fonnula I, one preferred embodiment is that in which R, is a functional 
group including, but not limited to, heteroaiylalkyl and substituted arylalkyl. Examples 
of such functional groups include, but are not limited to, the following: 



a 




Another preferred embodiment is that in which Rj is a functional group including, but 
not limited to, heteroarylalkyl, substituted arylalkyl and substituted aryloxyalkyl. 
Examples of such functional groups include, but are not limited to, the following: 




Also preferred is the embodiment in which R3 is a functional group including, but not 
limited to, substituted aryl, heteroaiylalkyl and substituted aryloxyalkyl. Examples of 
such functional groups include, but arc not limited to, the following: 
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Another preferred embodiment is when and R* and the carbons to 
which they are bound join to form an optionally substituted napthalene ring. In other 
preferred embodiments, R3 and R^ are both hydrogen or R5 is hydrogen and R^ is a meta 
5 or para substituent. 

By virtue of their ability to bind cathepsin D, the compounds of the 
present invention are useful for a variety of purposes. For those compounds in which the 
binding involves the formation of a non-covalent bond, the result is a complex which 
serves as a labelled form of the protease. The label may be the increase in molecular 

10 weight which results from the non-covalent attachment of the cathq)sin D-binding 

compound. Alternatively, the label may be a signal-generating moiety attached to or 
integrated into the structure of the cathepsin D-binding compound. Examples of such 
moieties are enzymes, fluorophores, chemophores, high-affinity groups and radioactive 
(isotopically labeled) atoms. A single complex may contain a single label or multiple 

15 labels of either the same or different types. Labelling in accordance with this invention 
may be performed on cathepsin D proteases regardless of their environment, in vivo or in 
vitro. Labelling may thus extend to proteases present in tissues and cells. The labelling 
wiD generally be followed by an appropriate detection technique, such as autoradiography 
or any of the wide variety of techniques known to those skilled in the art. 
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As one application of labelling in accordance with this invention, the 
cathq)sin D*binding compounds of this invention can be used as mechanistic probes of 
cathepsin D in tppologic assays of compounds for which the presence and nature of a 
cathq)sin D binding site is to be determined. A topologic assay, for example, will be 
performed by combining the following materials in a reaction vessel: 

(a) a labelled version of one or more of the compounds of the above 
formula whose cathq)sin D binding site is known, 

(b) a cathepsin D protease, and 

(c) a test compound whose cathepsin D binding site is to be 
determined. 

The amount of bindiog of the first cathepsin D-binding compound to cathepsin D is then 
determined and compared with the amount of such bindmg which occurs in the absence 
of the test compound. 

In addition to labelling applications, the cathepsin D-binding compounds 
can be administered for purposes of inhibiting protein processing by cathepsin D, thereby 
preventing such proteases from hydrolyzing a peptide substrate. In particular, the 
inhibition of cathepsin D has a number of important therapeutic applications. Such 
applications include the treatment of cancer, smce elevated levels of cathepsin D in 
tumors, particularly for breast cancer, have been correlated with poor prognosis due to 
cathepsin D mediated proteolytic degradation of the extracellular matrix resulting in 
tumor metastasis. In addition, inhibition of cathepsin D is effective for the treatment of 
Alzheimer's disease since elevated levels of cathepsin D have been identified in cerebral 
spinal fluid in Alzheimer's disease patients, and cathepsin D has been shown to have 
high proteolytic activity against mutant ^-protein precursor implicated in Alzheimer's 
disease. As such, the compounds of the present invention can be used, for example, in 
the treatment of cancer and Alzheimer's disease. 

Other features, objects and advantages of the invention and its preferred 
embodiments will become apparent from the detailed description which follows. 

BRIEF DESCRIPnON OF TEE DRAWINGS 
FIG. 1 iUustrate isostere-based inhibitor design. 
FIG. 2 illustrates components employed to prepare the libraries targeting 
cathepsin D. The same disconnections provide scaffold 2. Isocyanates and sulfonyl 
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chlorides, which can be used to incorporate Rj and R3, provide ureas and sulfonamides, 
respectively. 

FIG. 3 illustrates the used of BUILDERppt in designing the combinatorial 
library: (a) Modeling the Scaffold. Coordinates and P|-Pj conformations of the pq>statin 
inhibitor were used as the starting geometry for hydroxyethylamine scaffold. Methyl 
groups were placed at each of the scaffold's RJ-R4 positions, (b) Scaffold Confonnation. 
A conformational search about the three torsion angles of the scaffold yielded 4 
conformational families. A benzyl sidechain (Bn) was added to each of these families at 
the R4 position, (c) Evaluating library components. The program 
BUILDERppt performed a limited conformational search on all possible components at 
each variable position (R1-R3) on each family, and scored the components by their 
potential interaction with cathepsin D. The top scoring candidates for each family were 
merged. 

FIGS. 4A-4C illustrate the components used to prepare the Directed 
Library. Directed library components are labeled with a letter code. EHA is defined as 
Ri = E; Rj = H; and R3 = A, 

FIGS. 5A-5C illustrates the components used to prepare the Diverse 
Library. Diverse library components are labeled by lower case letter code as for the 
directed library. In FIG. 5 A, the /-butyl ester of R^ = i was used in the coupling 
reaction. In FIG. 5C, the Boc protected amine of R3 = d was used in the coupling 
reaction. These protecting groups are removed during TFArH^O cleavage. 

FIGS. 6A-6C illustrates the components in each of the clusters (see 
Experimental Design) that contained the most active sidechains, R* = E, F; R^ = F, H; 
R^ = A, D. J. Thirty-nine compounds incorporating these sidechains were synthesized 
on resin as described previously, EFD, EHD, FFD, FHD, KFD, KHD, LFD, LHD, 
MFD, MHD, NFD, NHD, OFD, OHD, PFD, PHD, QFD, QHD, RFD, RHD, SFD, 
SHD, TFD, TEID, UFD, UHD, VFD, VHD, EHA, EHJ, EHK, EHL, EHM, EHN, 
EHO, EHP, EHQ, EHR, EHS. The compounds were assayed at 333 nM, 100 nM and 
33 nM in high-throughput screening. The most active compounds were synthesized on 
large scale and the K, values were determined (Table 3). 

FIG. 7 illustrates structural diversity being introduced via Grignard 
addition to solid support-bound oc- alkoxy pyrrolidine amide. 
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FIG. 8 illustrates synthesis of solid phase aspaityl protease inhibitor 

synthesis. 

FIG. 9 illustrates components to generate libiaiy diversity in a 204 
compound library. 

DETAILED DESCRIPTION OF THE INVENTION 
AND PREFERRED EMBODIMENTS 

The present invention relates to (i) non-peptide cathq)sin D binding 
compounds; (ii) methods for binding new and known non-pcptide compounds to cathepsin 
D, (iii) methods for using non-pq)tide cathepsin D-binding compounds to inhibit 
cathepsin D. 

A. Definitions 

The term "independently selected" is used herein to indicate that the three 
R groups, Ue.y R„ Rj and R3, can be identical or different {e.g., R,, Rj and R3 may all 
be substituted alkyls or R, and Rj may be a substituted alkyl and R3 may be an aryl, 
etc.). 

The term "alkyl" is used hereui to refer to a branched or unbranched, 
saturated or unsaturated, monovalent hydrocarbon radical having from 1-12 carbons and 
preferably, from 1-6 caibons. When the alkyl group has from 1-6 carbon atoms, it is 
referred to as a "lower alkyl." Suitable alkyl radicals include, for example, methyl, 
ethyl, n-propyl, i-propyl, 2-propenyl (or allyl), n-butyl, t-butyl, i-butyl (or 2- 
methylpropyl), etc. As used herein, the tenn encompasses "substituted alkyls." 

"Substituted alkyl" refers to alkyl as just described including one or more 
functional groups such as lower alkyl, aryl, acyl, halogen (i.e., alkylhalos, e.g., CF3), 
hydroxy, amino, alkoxy, alkylamino, acylamino, acyloxy, aryloxy, aryloxyalkyl, 
mercapto, both saturated and unsaturated cyclic hydrocarbons, heterocycles and the like. 
These groups may be attached to any carbon of the alkyl moiety. 

The term "aryl" is used herein to refer to an aromatic substituent which 
may be a single aromatic ring or multiple aromatic rings which are fused together, linked 
covalently, or linked to a common group such as a methylene or ethylene moiety. The 
common linking group may also be a carbonyl as in benzophenone. The aromatic ring(s) 
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may include phenyl, napbthyl, biphenyl, diphenylmethyl and benzophenone among 
others. The tetm "aryl" encompasses "aiylalkyl." 

The tenn "aiylalkyl" is used herein to refer to a subset of "aiyl" in which 
the aiyl group is attached to the nucleus shown in Formula 1 by an alkyl group as 
defined herein. 

"Substituted aiyl" refers to aiyl as just described including one or more 
functional groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF,), hydroxy, 
amino, alkoxy, alkylamino, acylamino, acyloxy, phenoxy, mereapto and both saturated 
and unsaturated cycUc hydrocarbons which are fiised to the aromatic ring(s), linked 
covalenUy or linked to a common group such as a methylene or ethylene moiety. The 
linking group may also be a carbonyl such as in cyclohexyl phenyl ketone. TTie term 
"substituted aryl" encompasses "substituted aiylalkyl." 

"Substimted aiylalkyl" defines a subset of "substituted aiyl" wherein the 
substitoted aryl group is attached to the nucleus shown in Foimula 1 by an alkyl group as 
defined herein. 

The term "acyl" is used to describe a ketone substituent, — C(0)R, where 
R is alkyl or substituted alkyl, aryl or substituted aryl as defined herein. 

The term "halogen" is used herein to refer to fluorine, bromine, chlorine 
and iodine atoms. 

The terra "hydroxy" is used herein to refer to the group —OH. 

The term "amino" is used to describe primary amines, R— NHj. 

The term "alkoxy" is used herein to refer to the -OR group, where R is a 
lower alkyl, substituted lower alkyl, aiyl, substituted aryl, arylalkyl or substituted 
aiylalkyl wherein the alkyl, aiyl, substituted aiyl, aiylalkyl and substituted aiylalkyl 
groups are as described herein. Suitable alkoxy radicals include, for example, metiioxy, 
etiioxy, phenoxy, substituted phenoxy, benzyloxy, phenethyloxy, t-butoxy, etc. 

The term "alkylamino" denotes secondaiy and teitiaiy amines wherein the 
alkyl groups may be eitiier tiie same or different and may consist of straight or branched, 
saturated or unsaturated hydrocaibons. 

As used herein, the tenn "acylamino" describes substituents of the general 
formula RC(0)NR', wherein R' is a lower alkyl group and R represents tiie nucleus 
shown in Formula 1 or an alkyl group, as defined herein, attached to the nucleus. 
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The tenn "acyloxy" is used herein to describe an organic radical derived 
ftom an oiganic acid by the removal of the acidic hydrogen. Simple acyloxy groups 
include, for example, acetoxy, and higher homologues derived from caiboxylic acids 
such as ethanoic, propanoic, butanoic, etc. The acyloxy moiety may be oriented as 
5 either a forward or reverse ester (i.e. RC(0)OR* or R'OC(0)R, respectively, wherein R 
comprises the portion of the ester attached either directly or through an intermediate 
hydrocait)on chain to the nucleus shown in claim 1). 

As used herein, the term "aiyloxy" denotes aromatic groups which are 
linked to the nucleus shown in Figure 1 directly through an oxygen atom. This term 
10 encompasses "substituted aryloxy" moieties in which the aromatic group is substituted as 
described above for "substituted aiyl." 

As used herein "aryloxyalkyr defines aromatic groups attached, through 
an oxygen atom to an alkyl group, as defined herein. The alkyl group is attached to the 
nucleus shown in Figure 1. The term "aryloxyalkyl" encompasses "substituted 
15 aryloxyalkyr moieties in which the aromatic group is substituted as described for 
"substituted aryl." 

As used herein, the term "mercapto" defines moieties of the general 
structure R— S— R' wherein R and R' are the same or different and are alkyl, aryl or 
heterocyclic as described herein. 
20 The term "saturated cyclic hydrocarbon" denotes groups such as the 

cyclopropyl, cyclobutyl, cyclopentyl, etc., and substituted analogues of these structures. 

The term "unsaturated cyclic hydrocarbon" is used to describe a 
monovalent non-aromatic group with at least one double bond, such as cyclopentene, 
cyclohexene, etc, and substituted analogues thereof. 
25 The term "hcteroaryl" as used herein refers to aromatic rings in which one 

or more carbon atoms of the aromatic ring(s) are substituted by a heteroatom such as 
nitrogen, oxygen or sulfur. Heteroaryl refers to structures which may l>e a single 
aromatic ring, multiple aromatic ring(s), or one or more aromatic rings coupled to one or 
. more non-aromatic ring(s). In structures having multiple rings, the rings can be fused 
30 together, linked covalently, or linked to a common group such as a methylene or 
ethylene moiety. The common linking group may also be a carbonyl as in phenyl 
pyridyl ketone. As used herein, rings such as thiophene, pyridine, isoxazole, 
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phthalimide, pyiazole, indole, fiiran, etc. or benzo-fused analogues of these rings are 
defined by the tenn "heteroaiyL" 

"Heteioaiylalkyr defines a subset of "heteroaiyl" wherein an alkyl group, 
as defined herein, links the heteroaryl group to the nucleus shown in Figure 1. 
5 "Substituted heteroaiyl" refers to heteroaryl as just described wherein the 

heteroaryl nucleus is substituted with one or more functional groups such as lower alkyl, 
acyl, halogen, alkylhalos (e.^. CFj), hydroxy, amino, alkoxy, alkylamino, acylamino, 
acyloxy, mercapto, etc. Thus, substituted analogues of heteroaromatic rings such as 
thiophene, pyridine, isoxazole, phthalimide, pyrazole, indole, fiiran, etc. or benzo-fused 
10 analogues of these rings are defined by the term "substituted heteroaryl." 

"Substituted heteroarylalkyl" refers to a subset of "substituted heteroaiyl" 
as described above in which an alkyl group, as defined herein, links the heteroaryl group 
to the nucleus shown in Figure 1. 

The term "heterocyclic" is used herein to describe a monovalent saturated 
15 or unsaturated non-aromatic group having a single ring or multiple condensed rings ftx)m 
1-12 carbon atoms and ftom 1-4 heteroatoms selected from nitrogen, sulfur or oxygen 
within the ring. Such heterocycles are, for example, tetrahydrofuran, moipholine, 
piperidine, pyrrolidine, etc. 

The term "substituted heterocyclic" as used herein describes a subset of 
20 "heterocyclic" wherein the heterocycle nucleus is substituted with one or more functional 
groups such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF3), hydroxy, amino, 
alkoxy, alkylamino, acylamino, acyloxy, mercapto, etc. 

The term "heterocyclicalkyl" defines a subset of "heterocyclic" wherein an 
alkyl group, as defined herein, links the heterocyclic group to the nucleus shown in 
25 Figure 1. 

The term "optionally substituted napthylene ring" describes a naphthalene 
ring which may be unsubstituted or may be substituted with one or more functional 
groups including lower alkyl, halogen, acyl, hydroxy, amino, alkoxy, alkylamino, 
acylamino, acyloxy or aiyl. 
30 The term "substituted heterocyclicalkyl" defines a subset of "heterocyclic 

alkyl" wherein the heterocyclic nucleus is substituted with one or more functional groups 
such as lower alkyl, acyl, halogen, alkylhalos (e.g. CF3), hydroxy, amino, alkoxy, 
alkylamino, acylamino, acyloxy, mercapto, etc. 
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The term "contacting" is used herein interchangeably with the following: 
combined with, added to, mixed with, passed over, incubated with, flowed over, etc. 
Moreover, the cathq>sin D binding compounds of present invention can be 
"administered" by any conventional method such as, for example, parenteral, oral, 
5 topical and inhalation routes as described herein. 

"An amount sufficient" or "an effective amount" is that amount of a given 
cathepsin D compound which exhibits the binding/inhibitory activity of interest or, which 
provides either a subjective relief of a symptom(s) or an objectively identifiable 
improvement as noted by the clinician or other qualified observer. 

10 

B. Non-peptide Protease Binding Compounds 

The present invention relates to the identification of a number of 
small-molecule compounds which are capable of binding to and inhibiting cathepsm D 
employing a combined combinatorial library {see, e.g., Thompson, et al., Chemical 

15 Reviews, 96, 555-600 (1996)) and structure based design approach (see, e.g., Kuntz, 
LD., Science, 257, 1078-1082 (1992)). The libraries of potential cathepsin D binding 
compounds were based upon the display of functionality about the hydroxyethylamine 
scaffold illustrated in FIG, 1. For the initial libraries, the P, sidechain (R*) was held 
constant as a benzyl substituent based upon X-ray crystallographic data of cathepsin D 

20 complexed with the peptide-based natural product pepstatin as reported by Erickson 

(Baldwm, et al, Proc. Natl Acad. ScL USA, 90, 6796-6800 (1993)). As illustrated in 
FIG. 2, diversity was introduced at three positions: a primary amine introduced the R, 
substituent, and acylating agents serve to introduce the and R3 substituents. Once 
prepared, the libraries were screened to identify compounds capable of binding to and 

25 inhibiting cathepsin D. Thereafter, a second generation library was prepared in an effort 
to further explore variants of the most active compounds. Thus, by combining a 
structure-based design and a combinatorial library approach, non-peptidic compounds 
capable of binding to and inhibiting cathepsin D have now been identified. 

Accordingly, in one embodiment, the present invention provides 

30 compounds having the general formula: 
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H OH R, 

o k ^ o 



Fonnulal 

In Fonnula I, R„ Rj and are members independenay selected from the group 
consisting of alkyl, substituted alkyl, aiyl, substituted aiyl, aiylalkyl, substituted 
aiylalkyl, aryloxyalkyl, substituted aiyloxyalkyl, heteroaryl, substimted heteroaiyl, 
5 heteroaiylalkyl, substituted heteroaiylalkyl, heterocycles, substituted heterocycles, 
heterocyclicalkyl and substituted heterocyclicalkyl. 

In Formula I, R, and R^ are independently selected from the group 
consisting of hydrogen, halogen, alkyl, substituted alkyl, aiyl, substituted aiyl, aiylalkyl, 
substituted aiylalkyl, aiyloxyalkyl and substituted aiyloxyalkyl. In an alternative 

10 embodiment, R* and R* and the carbons to which they are bound, join to form an 

optionally substituted 9- or 10-ring atom caibocyclic or heterocyclic fused ring system. 
Typical 9- or 10-atom ring systems include, but are not limited to, napthalyl, 1,3- 
benzodioxolyl, 2,3-benzofiiranyl, 1,4-benrodioxanyl, benzimidazoyl, benzothiazolyl etc. 
Within the scope of the above Fonnula I, certain embodiments are 

15 preferred. In Fonnula I, one preferred embodiment is that in which R, is a functional 
group including, but not limited to, heteroarylalkyl and substituted arylalkyl. Examples 
of such functional groups include, but are not limited to, the following: 

<:xr-- Jr^.. .jx^ 



Another preferred embodiment is that in which R, is a functional group including, but 
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not limited to, heteroaiylalkyl, substituted aiylalkyl and aiyloxyalkyl. Examples of such 
functional groups include, but are not limited to, the following: 




Also preferred is the embodiment in which Ry is a functional group including, but not 
limited to. substituted aiyl, heteroaiylalkyl and substimted aryloxyalkyl. Examples of 
such functional groups include, but are not limited to, the following: 
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Another preferred embodiment is when and R' and the carbons to 
which they are bound, join to form an optionally substituted napthalene ring. In other 
preferred embodiments, R, and R« are both hydrogen, or R, is hydrogen and R« is a meta 
or para substituent on the benzyl ring. 

In Formula I, the benzyl ring may be rephoed by the substituent R4 (see 
below). In this embodiment, R< can be a member selected from the group consisting of 
alkyl, substituted alkyl, aiyl, substituted aryl, arylalkyl, substituted arylalkyl, 
aryloxyalkyl, substituted aiyloxyalkyl, heteroaryl, substituted heteioaryl, heteroarylalkyl, 
substituted heteroarylalkyl, heterocycles, substituted heterocycles, heterocyclicalkyl and 
substituted heterocyclicalkyl. 
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H OH Ri 

o ^ o 



Table 1 sets forth compounds in accordance with the present invention 
which are particularly preferred. The compounds in this table and throughout this 
specification are referred to by code numbers, which are used for convenience only, and 
are strictly arbitrary for purposes of this invention. 
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Table 1. Exemplar Protease Binding Compounds 



Compound 
Code No. 


Protease Binding Compounds | 
Fonnula 


EAA 




EFA 




EHA 


0 Jo 
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FAA 




FFA 




FHA 

• 


o So 


EHB 





wo 98/33795 



PCT/US98/02199 



EFD 


^ jti^ JC JL JL 


EHD 




EEF 




EHF 


X) ° 



21 



1 FHF 


do 


EFH 




EHH 




FFH 


6 XT Y ^ ° 



PAH 




EFI 








EAJ 


OM. M 
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EEJ 


-X. f 


EGJ 


a 

o o a 


EHJ 




FHJ 
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EHO 




FHO 


So' ° 


EHM 




EHR 
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EHS 




UHD 





The compounds of the present invention can be synthesized in a variety of 
ways, using conventional synthetic chemistry techniques. Typically, the compounds of 
the present invention are prepared according to the reaction scheme set forth in FIG, 2, 
wherein R,, Rj and R3 are as defined above. The use of appropriate organic solvents, 
5 temperature and time conditions for running the reactions are within the level of skill in 
the art. Reactions of this type are generally described by E.K. Kick and J.A. EUman, /. 
Med. Chem. 38, 1427-1430 (1995), the teachings of which are hereby incorporated by 
reference. 

C Binding Methods 
10 In one embodiment, the present invention contemplates using the 

above-named compounds to label cathq)sin D. In one case, catbepsin D can be 
"labelled" by virtue of an increase in molecular weight due to non-covalent binding of 
the compound. The increase in molecular weight can be detected by any sizing 
technique, such as HPLC, SDS-PAGE, and mass spectroscopy. 
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Hie present invention also contemplates labelling methods which involve 
attaching to the compounds or integrating into their structure at least one moiety c^ble 
of being detected, either by signal emission or by specific binding. Moieties such as 
these are generally intended to fecilitate the detection of cathepsin D or of molecules 
bound to cathepsin D. Exan^les of types of moieties useful for this purpose are 
enzymes, fluorophores, high-affinity conjugates, chemophores and radioactive atoms 
(radioisotopes). Examples of enzymes are alkaline phosphatase, /3-galactosidase and 
glucose oxidase. An example of an affinity conjugate system is the biotin-avidin system. 
An example of a fluorophore is fluorescein. An example of a chemophore is luminol. 
Examples of radiolabels are 'H, "C, and Other detection moieties known to and 
used by those of skill in the art can be used in the methods of the present invention. 

As indicated above, single or multiple labels can be present in a single 
complex, with multiple labels being the same or different. In the use of the invention for 
facilitating the detection of cathepsin D, preferred labels are tritium CH) and "C. A 
preferred label for facilitating the detection of molecules bound to the compounds is 
biotin. 

The present invention contemplates using labelled analogs of the 
compounds disclosed herein to label cathepsin D in tissues and cells. This type of 
labelling can be used both diagnostically and prognostically. Quantitation of cathepsin by 
this labelling technique can be performed in many ways known to the art, including 
methods using tritiated analogs of the compounds and autoradiography of treated cells on 
microscope slides. In addition, there are a number of automated detection systems 
described for fluorescent staining that also can be employed. See, for example, Resnick, 
et al., U.S. Pat. Nos. 4,125,828 and 4,207,554, hereby incorporated herein by 
reference. 

the present invention also contemplates the in vitro use of the compounds 
disclosed herein as topologic and mechanistic probes of cathq)sin D. In one 
embodiment, the topologic assay utilizes labeDed compounds whose protease binding 
sites are known. In addition, the known cathepsin D-binding sites of the compounds of 
the invention allow die compounds to be used in the determination of binding sites for 
other (peptide and non-pqitide) compounds. In one embodiment, a compound of this 
invention is used in a competition assay with a second compound whose protease bindmg 
site is to be tested. The compounds of this invention can be added to catiiepsin D 
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together or in any sequential order. Where the compound of this invention is labelled, it 
is preferred that the second compound be added first to allow it to block (if possible) the 
binding site. Similarly, where the second compound is labelled, it is preferred that the 
compound of this invention be added first. 

The present invention also contemplates binding methods to immobilize 
cathepsin D. In one embodiment, the present invention contemplates using a cathepsin 
D-binding compound of the invention that will bind non-covalently to cathq)sin D to 
immobilize this protease on a solid support. Such a method is useful in the purification 
of the protease. 

Binding of the above-described compounds is in part a function of 
solubility. If needed, the solubility of these compounds can be enhanced in aqueous 
solutions by the use of a co-solvent. The preferred co-solvent is dimethylsulfoxide 
(DMSO). The concentration range of DMSO is between 0,1 % and 10%, with a 
prefened range of between 0.5% and 5%, 

Z). Cathepsin D Inhibition 

The compounds of the present invention have been found to be potent 
inhibitors of cathq)sin D. As such, the present invention contemplates using the 
compounds of the present invention to inhibit cathepsin D, either in vivo or in vitro. In 
one embodiment, the present invention provides a method of inhibiting cathq>sin D, the 
method comprising contacting cathepsin D with a compound of the formula: 




Fonnula I 

In the above formula, Ri, and R3 are members independently selected from the group 
consisting of alkyl, substituted alkyl, aryl, substituted aryl, arylalkyl, substituted 
arylalkyl, aryloxyalkyl, substituted aryloxyalkyl, heteroaryl, substituted heteroaryl, 
heteroarylalkyl, substituted heteroarylalkyl, heterocycles, substituted heterocycles, 
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heterocyclicalkyl and substituted heterocyclicalkyl. The prior discussions pertaining to 
Rt, R3 and R3 and their preferred embodiments are fiiUy applicable to the catbepsm D 
inhibitors used in this method of the present invention and, thus, wUl not be TcpeaXcd 
with respect to this particular method. R5 and R« are as defined above. 

In another embodimdit, the present invration provides a method of 
inhibiting protein processmg by cathepsin D in living cells, the method comprising 
contacting the cells with an effective amount of a compound of the formula 

H OH R, 



Fonnula I 

The prior discussions pertaining to Rj, Rj R3 R^ and R^ and their preferred embodunents 
arc fully applicable to the cathepsin D inhibitors used in this method of the present 
invention and, thus, will not be repeated with respect to this particular method. 

Compounds capable of inhibiting cathepsin D can readily be identified 
using the assays described herein which measure a change in the hydrolysis of a peptide 
substrate. More particularly, a fluorometric high through-put assay for activity toward 
human liver cathepsin D (Calbiochem) can be used to screen the compounds of the 
present invention for their ability to inhibit cathepsin D. This assay was previously 
described by G. A. Kraft, et al. Methods Enzymol 241, 70-86 (1994), the teachings of 
which are incorporated herein by reference. Moreover, the.peptide substrate (Ac-Glu- 
Glu(Edans)-Lys-Pn>-ne-Cys-Phe-Phe-Arg-Leu-Gly-Lys(Methyl Red).Glu.NH2) used in 
the assay has been previously reported (K„ = 6 /iM) (E. T. Baldwin, et al., Proc. Natl 
Acad. Sci., U.S.A. 90, 6796-6800 (1993)). Generally, the reactants are nuxed, the 
reaction is allowed to proceed for a specific period of time and the fluorescence of the 
reaction products is monitored to determine the extent to which the peptide substrate has 
been cleaved. Compounds found to exhibit inhibitory activity towards cathepsin D using 
the foregoing assay can be synthesized on a larger scale and a more detailed kinetic 
analaysis can be carried out using an assay similar to that set forth in Table 3, infra, and 
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described in greater detail by G. A. Kraft, et al., Methods ErtzymoL 241, 70-86 (1994). 
As such, following the methods of the present invention, compounds can be readily 
synthesized and screened to identify compounds that inhibit cathepsin D. 

As explained above, cathepsin D is a lysosomal enzyme that plays an 
5 important role in protein metabolism, catabolism and antigen pixKessing. As a result of 
their ability to inhibit cathepsin D, the compounds of the present invention can be used 
for a number of ther^)eutic applications. Such plications include the treatment of 
cancer, since elevated levels of cathepsin D in tumon, particularly for breast cancer, 
have been correlated with poor prognosis due to cathq)sin D mediated proteolytic 
10 degradation of the extracellular matrix resulting in tumor metastasis (see, e.g., B. R. 
Westley, et al , Eur. J. Cancer 32, 15-24 (1996)). 

As such, the present invention provides a method for inhibiting the growth 
of a tumor cell, the method comprising contacting the tumor cell with a compound 
having the fonnula: 




Fonnula I 

15 

The prior discussions pertaining to Ri, Rj, R3 Rj and and their preferred 
embodiments are fully applicable to the cathepsin D inhibitors used in this method of the 
present invention and, thus, will not be repeated with respect to this particular method. 
In a presently preferred embodiment, the compounds of the present 

20 invention are used to inhibit the growth of a tumor cell in a mammalian subject, the 
method comprising administering to the mammalian subject a therapeutically effective 
amount of a compound of the present invention. In accordance with this method, 
nuunmalian subjects include, but are not limited to, humans, laboratory animals, 
domestic pets and farm animals. Moreover, tumor cells include, but are not limited to, 

25 lung, colon, breast, ovarian, prostate and hepatic tumor cells. In a presently preferred 
embodiment, the tumor cells are breast tumor cells. 
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In addition to the foregoing, inhibition of cathq)sin D is effective for the 
treatment of Alzheimer's disease since elevated levels of catbepsin D have been identified 
in cerebral spinal fluid in Alzheimer's disease patients, and cathepsin D has been shown 
to have high proteolytic activity against mutant /3-protein precursor implicated in 
Alzheimer's disease (see, e.g., Ladror, U. S., et al., J. Biol Chem. 269, 18422-18428 
(1994); Cataldo, A.M., etal., J. NeuroscL 16, 186-199 (1996)). 

As such, the present invention provides a method of inhibiting the 
proteolysis of a mutant j3-protein precursor in a patient afflicted with Alzheimer's 
disease, the method comprising administering to the patient a cathepsin D inhibitor in an 
amount effective to inhibit the proteolysis of the mutant /3-protein precursor and a 
pharmaceutically acceptable carrier, the cathepsin D inhibitor having the formula: 




Formula I 

The prior discussions pertaining to R,, Rj, R3 R3 and R^d their preferred embodiments 
are fully applicable to the cathepsin D inhibitors used in this method of the present 
invention and, thus, will not be repeated with respect to this particular method. 

The compounds, i.e. , aspartic protease inhibitors, of this invention can be 
incorporated into a variety of formulations for therapeutic administration. More 
particularly, the compounds of the present invention can be formulated into 
pharmaceutical compositions by combination with appropriate, phannaceuticaUy 
acceptable carrien or diluents, and may be formulated into prq)arations in solid, 
semi-solid, liquid or gaseous forms, such as tablets, capsules, powders, granules, 
ointments, solutions, suppositories, injections, inhalants and aerosols. As such, 
administration of the compounds can be achieved in various ways, including oral, buccal, 
rectal, parenteral, intraperitoneal, intradermal, transdermal, intracheal, etc. , 
administration. Suitable formulations for use in the present invention are found in 
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Remington's Pharmaceutical Sciences (Mack Publishing Company, Philadelphia, PA, 
17th ed. (1985)), which is incoiporated herein by reference. In addition, for a brief 
review of methods for drag delivery, see, Langer, Science 249:1527-1533 (1990), which 
is incoiporated herein by reference. 
5 The compounds of the present invention can be administered alone, in 

combination with each other, or they can be used in combination with other known 
compounds (e.g. , other protease inhibitors). In pharmaceutical dosage forms, the 
compounds may be administered in the form of their pharmaceutically acceptable salts, 
or they may also be used alone or in appropriate association, as well as in combination 

10 with other pharmaceutically active compounds. The following methods and excipients 
are merely exemplary and are in no way limiting. It should be noted that since the 
compounds of the present invention are non-peptidic in nature, they tend to have better 
pharmacokinetic propeities {e.g., better oral availability and increased circulating half- 
lives) than compounds that are peptidic in nature. 

15 For oral preparations, the compounds can be used alone or in combination 

with appropriate additives to make tablets, powders, granules or capsules, for example, 
with conventional additives, such as lactose, mannitol, com starch or potato starch; with 
binders, such as crystalline cellulose, cellulose derivatives, acacia, com starch or 
gelatins; with disintegraton, such as com starch, potato starch or sodium 

20 caiboxymethylcellulose; with lubricants, such as talc or magnesium stearate; and if 

desired, with diluents, buffering agents, moistening agents, preservatives and flavoring 
agents. 

The compounds can be fomiulated into preparations for injections by 
dissolving, suspending or emulsilying them in an aqueous or nonaqueous solvent, such as 
25 vegetable or other siniilar oils, synthetic aliphatic acid glycerides, esters of higher 

aliphatic acids or propylene glycol; and if desired, with conventional additives such as 
solubilizers, isotonic agents, suspending agents, emulsifying agents, stabilizen and 
preservatives. 

The compounds can be utilized in aerosol formulation to be administered 
30 via inhalation. The compounds of the present invention can be formulated into 

pressurized acceptable propellants such as dichlorodtfluoromethane, propane, nitrogen 
and the like. 
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Fuithennoie, the compounds can be made into suppositories by mixing 
with a variety of bases such as emulsifying bases or water-soluble bases. The compounds 
of the present invention can be administered rectally via a suppository. The suppository 
can include vehicles such as cocoa butter, caibowaxes and polyethylene glycols, which 
melt at body temperature, yet are solidified at room temperature. 

Unit dosage fonns for oral or rectal administration such as syzups, elixin, 
and suspensions may be provided wherein each dosage unit, for example, teaspoonAil, 
tablespoonful, tablet or s»q)positoiy, contains a predetennined amount of the composition 
containing one or more compounds of the present invention. Similarly, unit dosage 
fonns for injection or intravenous administration may comprise the compound of the 
present invention in a composition as a solution in sterile water, normal saline or another 
pharmaceutically accq)table carrier. 

The term "unit dosage form," as used herein, refers to physically discrete 
units suitable as unitary dosages for human and animal subjects, each unit containing a 
predetermined quantity of compounds of the present invention calculated in an amount 
sufficient to produce the desired effert in association with a pharmaceutically acceptable 
diluent, carrier or vehicle. The specifications for the novel unit dosage forms of the 
present invention depend on the particular compound employed and the effect to be 
achieved, and the pharmacodynamics associated with each compound in the host. 

The pharmaceutically accq)table excipients, such as vehicles, adjuvants, 
carriers or diluents, arc readily available to the pubUc. Moreover, pharmaceutically 
acceptable auxiliary substances, such as pH adjusting and buffering agents, tonicity 
adjusting agents, stabilizers, wetting agents and the like, are readily available to the 
public. 

Preferred formulations of the compounds are oral preparations, particulariy 
capsules or tablets containing each from about 10 milligrams up to about 1000 milligrams 
of active ingredient. The compounds are formulated in a variety of physiologically 
compatible matrixes or solvents suitable for ingestion or injection. 

The invention will be described in greater detail by way of specific 
examples. The following examples are offered for illustrative purposes, and are not 
intended to limit the invention in any manner. Hose of skill in the art will readily 
recognize a variety of noncritical parameten which can be changed or modified to yield 
essentially the same results. 
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A. Specific Approach 

One powerful strategy to target an enzyme class is to incoiporate a stable 
5 mimetic or isostere of the transition state or of an intermediate of the enzyme-catalyzed 
reaction (R. A. Wiley, et al., Med. Res. Rev. 13, 327-384 (1993)). The Ubraries for 
potential cathepsin I) inhibitors are based upon the well-known hydroxyethylamine 
isostere {see, FIG. 1). For the initial libraries, the Pj side chain (R^) is held constant as 
the benzyl substituent, based on the X-ray ciystallographic structure of cathepsin D 

10 complexed with the natural peptide inhibitor pq)statin (E. T. Baldwin, et al. , Proc. Natl. 
Acad. ScL, U.S.A. 90, 6796-6800 (1993)), and upon inhibition constants of peptide- 
based inhibitors (R, A. Jupp, et al, Biochem. 7. 265, 871-878 (1990); N, S. Agarwal, 
etc., J. Med. Chem. 29, 2519-2524 (1986)), 

In a pilot study both 5 and J? epimers at the hydroxyl carbon (see, 

15 structures 1 and 2 of ilG. 1) were prepared since both diastereomers have been found in 
potent inhibitors of other aspartic acid proteases (R. A. Wiley, et al, Med. Res. Rev. 
13, 327-384 (1993)). Because inhibition at 1 fM was only found with compounds of 
scaffold 1 in the pilot study, further syntheses of libraries toward cathepsin D used only 
scaffold 1. Computer modeling (see below) predicted that structure 1 (FIG, 1) would 

20 provide the most potent inhibitors. Diversity is introduced in three positions: a primary 
amine for the Rj substituent and acylatmg agents for the Rj and R3 substituents (FIG. 2). 
The optimization of the synthesis sequence was previously reported (E. K. Kick, J. A. 
EUman, /. Med. Chem. 38, 1427-1430 (1995)). 

The library synthesis was designed to use commercially available 

25 compounds for incorporation of the functionality at R„ Ra, and R3. Exhaustive 

combination of available materials would provide a library of over 10 billion compounds. 
To reduce these possibilities in a sensible way, version 93,2 of the Available Chemical 
Directoiy (ACD) from MDL Information Systems (San Leandro, CA) was used to search 
for all amines, caiboxylic acids, sulfonyl chlorides and isocyanates with MW < 275 

30 daltons. Compounds with functionality obviously incompatible with the synthesis were 
eliminated. The resulting list included approximately 700 amines and 19(X) acylating 
agents. However, this list still provided access to more than 1 billion compounds. 
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Clearly, additional selection criteria were required, and a computational screening 
process was turned to in an effort to oihance selection. 



B. Directed U bmrv Desipt 

The structure-based design process began with coordinates for pq)statin in 
a complex with cath^sin D (E. T. Baldwin, et al., Proc. Natl. Acad. Sd., U.S.A. 90, 
6796-6800 (1993)). The scaffold is identical to pcpstatin on tiie Pj-P, side, but differs on 
the B,.-P3. side and cannot form the same hydrogen bonds with the enzyme (FIG. 3A). 
Thus, tile pepstatin positions for tiw P,-Pj side were used and tiie tiiree scaffold torsion 
angles on tiie P,.-P,, side were systemically rotated. Each rotation was followed by 
energy minimization witiiin tiie cafliepsin D active site, using tiie AMBER (S. J. Wdner, 
et al.,J. Am. Chem. Soc. 106. 765-784 (1984)) force field in Sybyl, a molecular 
modeling software package from Tripos Associates (St. Louis, MO). During 
minimization, tiie enzyme was kept rigid, but fiiU flexibility of tiie scaffold was allowed. 
Botii SandR q)imers, structures 1 and 2, were modeled using metiiyl groups for each of 
tiie Rj-R^ groups. The conformational energies of tiie R epimers were generally ca. 2 
kcal higher tiian for S epimers, leading to tiie prediction tiiat tiie S epimers would bind 
more tightiy tiian tiie R epuners. All minimized conformations of S epimers witiiin a 2 
kcal/mol range were collected and clustered into four families based on geometric 
similarity (HG. 3B). A benzyl group was added to each famUy at tiie R^ position. The 
processed list of compounds for tiie ACD was passed tiirough Sybyl to obtain Gasteiger 
and MarsiU partial atomic charges for each component (J. Gasteiger, et al. , Tetrahedron 
Lett 36, 3219 (1980); J. Gasteiger, M. MarsiU, Organ. Magn. Reson. 15, 353 (1981)). 
To reduce tiie computational time for searching tiie components, compounds witii more 
tiian 4 torsional bonds were identified and removed. A new feature of the BUILDER 
molecular modeling program (R. A. Lewis, et al., J. Mol. Graphics 10, 66-78 (1992); 
D. C. Roe, and Kuntz, LD., JCAAO) 9, 269-282 (1995)), called BUILDERopt (D. C. 
Roe, Dissertation, University of California, San Francisco (1995)), was used to position 
each of tiie R„ R,, and R3 components onto tiie scaffold and to perform a full 
confonnational search for tiie torsion angles of tiie substituent at 15 degree increments. 
In order to reduce tiie combinatoric problem, tiie R„ R,. and R, components were 
examined independentiy, but a probability-based clash grid was constructed to identify R, 
and Rj confonnations tiiat might overlap. For example, if an R, confonnation clashed 
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with more than 50% of the components^ that confonnation was discarded. Each 
rotation was then examined for intramolecular clashes with the scaffold and overlap with 
cathepsin D. Each accepted confonnation was rigid-body mimmired (D. A. Gschwend, 
etal.,J. Compt-Aided Drug Design 10, 123-132 (1996)) and scored with a force-field 

5 grid (E* C. Meng, e!aL,J. Comput. Chem. 13, 505-524 (1992)). The total computer 
time required to evaluate all torsion angles for all sidechains attached to four different 
scaffold conformations was 16 hours on a Silicon Graphics Iris R4400. The fifty best 
scoring components for all families were merged for each of the three variable positions, 
and sorted by overall lowest score. Components with cost above $35/gm were removed, 

10 leaving 34, 35, and 41 components at Ri, Rj and Ra, respectively. Each remaining 
component was structuraUy fmgerprinted (Daylight Clustering Toolkit, Daylight 
Chemical Information Systems, Inc., Santa Fe, NM) and hierarchically clustered 
(similarity cutoff = 0.63) (H. C. Romesburg, Cluster Analysis For Researchers 
(Lifetime Learning Publications, Belmont, CA, 1984)) using the Tanimoto similarity 

15 metric (P. Willett, Similarity and Clustering in Chemical Information Systems (John 

Wiley & Sons, New York, NY, 1987); P. Willett, et a/., J. Chem. If. Comput. Sci. 26, 
109-118 (1986)). For Ri, Rj, and R3, the ten best scoring components from unique 
clusters were selected for the directed library. 

20 C. Diverse Library Desi^ 

A diverse library, which was set at the same size as the direaed library, 
was prepared to provide a "hit" rate when structure-based methods were not employed. 
The diverse library was designed to maximize the variety of functional groups and 
structural motifs of the library components. The sidechains for this library were selected 

25 by clustering the original list of components based on their similarity to each other. 

Components were clustered with the Jarvis-Patrick algorithm (R. A. Jarvis, et al, IEEE 
Comput C22, 1025-1034 (1973)) using the Daylight connectivity measure of similarity 
(Daylight Clustering Toolkit, Daylight Chemical Information Systems, Inc., Santa Fe, 
NM) and a bmary Tanimoto metric (P. Willett, Similarity and Clustering in Chemical 

30 Information Systems (John Wiley & Sons, New York, NY, 1987); P. Willett, et al, /. 
Chmt. If Comput. Sci. 26, 109-118 (1986)). In the Jarvis-Patrick method, two 
compounds are placed in the same cluster if they: 1) are neighbors of one another, and 
2) share at least p neighbors tom a list of q nearest neighbors, where p and q are 
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adjustable parameters. The compound nearest the cluster centroid was chosen as the 
cluster r^resentative. 

The Ri (amine) components were clustered directly as the piimaiy amines. 
The R3 and acylating agents were each attached to a portion of the scaffold before 
5 clustering to yield the proper chemical context at the linkage site. The first round of 
clustering yielded 47, 154, and 162 clusters using p/q = 4/11, p/q = 4/12, and p/q = 
4/12 for Rj, R2, and R3, respectively. The representative Rj and R3 components were 
clustered a second time (p/q = 4/7 for Rj and p/q = 4/7 for R3), resulting in 23 Rj and 
35 R3 components. It is noted that it is not practical to condense a large number of 

10 compounds into an arbitrarily small number of clusters because the cluster membership 
can become very diverse. Final selection of ten compounds from each list was based 
upon: size, cost, availability and synthetic feasibility. Additionally, a balance of 
functional groups for each set of sidechains was sought. A comparison of the directed 
and diverse libraries (FIGS. 4 and 5) shows the much greater range of functionality 

15 spanned in the diverse library, 

D, Library Synthesis and Screening 

The directed and diverse libraries (1000 compounds each) were prepared 
using diastereomer 1 of the hydroxyethylamine scaffold with the components used in 
20 library syntheses shown in PIGS. 4 and 5, respectively. Because the pilot study with R 
and S epimers only showed activity at 1 fiM inhibitor concentration for the S epimers, 
only the 5 epimers of the directed and diverse library were synthesized. AU libraries 
were synthesized in a spatially separate format, and were screened in a high-throughput 
fluorometric assay for inhibitory activity against cathepsin D (G. A. Krafft, et aL, 
25 Methods Enzymol 241, 70-86 (1994)) 

i. Ubmry Synthesis 

The optimization of the solid-phase synthesis sequence to prepare the 
hydroxyethylamine inhibitors and the demonstration of reaction generality was previously 
30 reported by E. K. Kick and J. A. EUman (7. Med. Chem. 38, 1427-1430 (1995)). 

Further testing was performed to establish that the different functionality to be displayed 
at Ri, Rj and R3 would be successfully incorporated into the potential inhibitors. First, 
all the amines and acylating agents to be incorporated in both the diverse and directed 
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libraries were treated with trifluoroacetic acid for 2 h at room temperature to ensure 
stability to the support-cleavage conditions, by far the harshest reaction conditions in the 
synthesis sequence. Second, components that might pose difficulties on chemical or 
steric grounds were evaluated by trial syntheses. Five amines and four carboxylic acids 
5 that did not provide the expected final compound in high yields or purity were discarxied. 
The following amines and acylating agents were successfully tested in the synthesis 
sequence: R, = B, C, E, F, a, e, h, i, j; = B, C, D, E, H, a, e. f; R9 = A, D E H, 
^9 e, g, h, i (FIGS. 4 and 5). The remaining components were assumed to be 
compatible with the synthesis sequence. 

10 The library synthesis was performed on polystyrene beads (20-40 mesh). 

The library was synthesized in a spatially sq)arate array using a 96-well filter q>paratus. 
Transfer of the resin to the individual wells was performed using an isopycnic mixture of 
7V,AWimethylformamide (DMF) and l,2-dichlon)ethane. Incorporation of R, was carried 
out using 1 .0 M ftee amine in TV-methylpyrrolidinone (NMP) at 80**C for 36 h, 

15 Incoiporation of Rj was carried out using stock solutions of 0.3 M caiboxylic acid, 0.3 
M benzotriazole-l-yl-oxy-tris-pyrrolidino-phosphonium hexafluorophosphate (PyBOP), 
0.3 M T-aza-l-hydroxybenzotriazole (HOAt), and 0.9 M /Pi2EtN in NMP overnight. 
The coupling reactions were perfonned twice to ensure tliat complete coupling had 
occurred. After azide reduction with SnClj, PhSH and EtjN, incoiporation of R3 was 

20 carried out as reported above for Rj. Carboxylic acid R2 = E was coupled using 0-(7- 
azabenzotriazol-l-yl)-! , 1 ,3,3-tetramethyl-uromum hexafluorophosphate (HATU) instead 
PyBOP due to formation of a precipitate under the standard coupling procedure. The 
isocyanate Rj = b was coupled at 0.3 M in NMP overnight, and the sulfonyl chlorides 
Rj = e and R3 - c were coupled at 0.3 M in NMP that was 0.9 M in zPrjEtN. 

25 Cleavage of the material from the support was achieved by subjecting the resin to 95:5 
trifluoroacetic acid: HjO for 30 min. The cleavage mixture was removed ftom the resin 
via filtration, foUowed by rinsing the resin and concentration of the filtrates using a 
Jouan 10.10 centrifugation concentrator. Toluene was added to form an azeotrope with 
trifluoroacetic acid during the concentration step. After concentration, the libraries were 

30 stored at -20* C. 

Compounds from each library, picked by random number generation, were 
analyzed by mass spectrometry in a matrix of a-cyano cinnamic acid on a Perseptive 
Biosystems MALDI instrument For the diverse library the expected molecular ion 
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peaks were observed for 46 of 49 compounds (poor ionization was obtained for the other 
three). Molecular ion peaks weie obtained for 44 of 49 compounds from the directed 
library. In addition, the synthesis has been validated by the reasonable correlation of the 
q^roximate IC50 values of the crude material from the libraries with purified material 
that was synthesized on large scale for a number of compounds (see. Table 3, infra). 

2. Screening of the Libraries for Compounds Having Inhibitory 
Activity Against Cathepsin D 

Briefly, a fluorometric high through-put assay for activity toward human 
liver cathepsin D (Calbiochem) was performed in 96-well microtiter plates (G. A. Krafft, 
et al. Methods Enzymal. 241, 70-86 (1994)). The peptide substrate (Ac-Glu- 
Glu(Edans)-Lys-Pro-Ile-Cys-Phe-Phe-Arg-Leu-Gly-Lys(Methyl Red)-Glu-NH2) used in 
the assay has been previously reported (K^, = 6 fM) (E. T. Baldwin, er a/., Proc. Natl. 
Acad. 5a., U.S.A. 90. 6796-6800 (1993)). The assay was performed in DYNATECH 
Microfluor fluorescence microtiter plates, and readings were taken on a Perkin-Elmer 
LS-50B with an attached 96-well plate reader. The excitation wavelength was 340 nm. 
A 340 nm interference filter (Hoya, U-340) for excitation and a 430 nm cut-off filter for 
emission were used. For the microtiter-based assays, the substrate concentration was 5 
fiM and the cathepsin D concentration was 9 nM in a 0.1 M formic acid buffer (pH = 
3.7). DMSO (10%) was used to ensure complete dissolution of the inhibitors. The 
fluorescent unit readings were taken at three time points within the linear region of the 
substrate cleavage, and percent activity of the enzyme was determined by comparing the 
change of fluorescent units (FU) for each well to the average change in FU for six 
control wells without inhibitor. Each library was screened at approximately 1 ^iM 
inhibitor with the concentration based on the assumption that 50% of the thwretical yield 
was obtained for each inhibitor. All wells that showed <50% cathq)sin D activity were 
screened at subsequent three-fold dilutions. All active compounds that showed <60% 
enzyme activity in 1 (jM or lower inhibitor concentrations were assayed in duplicate). 

E. Assay Results 

At approximately 1 /iM of crude compound, the directed library yielded 67 
compounds that inhibited cathepsin D activity ^ 50%(G. A. Krafft, et al, Methods 
EnzymoL 241, 70-86 (1994)). Further dilution of 333 nM and 100 nM inhibitor 



wo 98/33795 



PCTAJS98/D2199 



39 



concentrations affoided 23 and 7 compounds, respectively, that inhibited cathepsin D 
activity S 50% {see. Table 2). The data for the diverse libraiy are also in Table 2, 
infra. TTiere are many uncertainties that can influence the results of a high-throughput 
fluorescence assay, including the purity of each compound, the concentration of the 
compounds, and the experimental enon associated with the microtiter fluorescence 
assay. From rqjetitive expMiments, these errors were estimated to be proximately 
30%, e:q)ressed as enzyme activity. 



Table 2. Number of Compounds with ^ 50% Inhibition of Cathepsin D in 
Library Screen' 





library 


[Inhibitor] 


Directed 


Diverse ^ 


100 nM 


7* 


1§ 


330 nM 


23t 


31 


1 mM 


67 


26 


10 /iM 


11/95* 





• Inhibitors of cathepsin D at respective concentrations: *EAA, EFA, EHA, EHD, EHI, 
EHJ, FHA. An additional sue compounds provided 40-50% inhibition of cathq)sin D. 
tEAA, EFA, EHA, FAA, FFA, FHA. EHB, EFD, EHD, EEF, EHF, FHF, EFH, 
EHH, FAH, FFH, EFI, EHI, EAT, EFJ, EGJ, EHJ, FHJ. An additional thirty 
compounds provided 40-50% inhibition of cathepsin D. $One hundred compounds were 
selected by random number generation for testing at 10 fM. Five compounds were 
highly fluorescent at these concentrations, so tiiat accurate assay data could not be 
obtained in these cases. §fbb, Ifba. fbh, fob. Four compounds (fca, fdb, fib, hhb) 
provided 40-50% inhibition of catijepsin D; with tiie experimental error in tiie assay, this 
activity is similar to tiie activity for tiie tiiree Uiat are listed. ffTht diverse library was 
not tested at 10 /iM. 
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In Older to obtain accurate inhibition constants (JQ several of the 
compounds most likely to be potent inhibitors based on the library screening were 
synthesized on a larger scale, purified by chromatography, and characterized by NMR 
and mass spectrometry. Ihe K, values were calculated ftom IC50 detenninations (see. 
Table 3). From the compounds that were fully characterized, one compound was 
obtained from the directed library with a K, below 100 nM, whereas the diverse library 
contained inhibiton that were 3-4 times less potent. 

Table 3. Inhibition Constants for a Number of the Compounds That Are Potent 
Inhibitor^" 



Cpd Code 


Scaffold 




EHD 


1 


73 ± 9 


EHD 


2 


> 5000 


ma 




111 ± 8 


EHA 




131 ± 12 


EFA 




171 ± 25 


FHA 




231 ± 31 


fbb 




356 ± 31 


fdb 




595 ± 66 



15 



20 



25 



30 



35 



40 



' The cathepsin D assay for "hits" from the directed and diverse libraries was 
performed in a quaitz cuvette with a Perkin-Elmer LS-50B spectrometer. The substrate 
concentration was 2.5 nM and the cathepsin D concentration was 10 nM. Inhibition 
constants (ATj) were determined from ICjo values taken from plots of V/Vq versus 
inhibitor concentration, where V, is the velocity in absence of the inhibitor and Vj is the 
velocity with inhibitor. Since the substrate concentration is signiflcantly below k„, the 
IC30 values were converted to by the equation ^ « (ICjo - E/2), where E, = enzyme 
concentration (S. Cha, etai, Biochem. Pharmacol., 24, 2187-2197 (1975)). 

F.(i) Second Generation Uhmrv 

In the design of the directed library, derivatives with a high level of 
structural similarity were selected against by applying a clustering algorithm to the 
highest scoring components (see Directed Library Design). These clusters were le- 
wcamined to explon the important structural elements of the most active compounds from 
the directed library. In particular, a small second generation library from the clusters for 
the R„ R, and positions that provided the most active compounds was synthesized and 
screened (see, FIG. 6). At 1 ^M, 92% of the compounds screened inhibited cathepsin D 
^ 50%, and 18% of the compounds at 100 nM inhibited cathepsin D ^ 50%. 
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Inhibition constants weie determined for selected compounds {see, Table 4), providing 
several potent inhibitors (K^ ^ 15 nM) of cathq)sin D. 



Table 4. Second Generation Assa^ (see, FIG. €f 



Cpd. Code 


Scaffold 


ICjodiM) 


X;(nM) 


EHO 


1 


19 ± 2 


15 


EHO 


2 


>5000 




FHO 




18 ± 2 


14 


EHM 




14 ± 2 


9 


EHR 




20 ± 2 


15 


EHS 




64 ± 6 


59 


UHD 




229 ± 44 


224 



' Assay conditions are reported in Table 3. 

15 

F.(U) Additional Compounds 

Known aspaityl protease inhibitors have both (/?) and (5) stereocenters 
about the hydroxyl group in Fonmula I. Employing a-alkoxy chelation and non-chelation 

20 controUed reductions, the following synthetic strategy demonstrates acyclic 

diastereocontrol on solid support providing access to either desired diastereomer. By 
exploring different functional groups for and and selecting the Rj, Rj, and R3 
substituents providing the most potent Cathepsin D inhibitors, additional low nanomolar 
Cathepsin D inhibitors were discovered. 

25 Structural diversity may be derived through Grignard addition to a solid 

support-bound a-alkoxy pyrrolidine amide 3 {see Figure 7). The source of diversity is 
derived from aromatic and alkyl Grignard reagents. The Grignard reagents that are not 
conmiercially available can be synthesized using activated magnesium turnings, or a 
magnesium anthracene TEIF complex and the corresponding aromatic and alkyl halides. 

30 Grignard reagents are a suitable source to introduce diversity in the P, site of potential 
aspartyl protease inhibitors, since the Sj protease surface tends to be hydrophobic. The 
resulting ketone is reduced using chelation and non-chelation conditions to provide the 
desired diastereomer. After several functional group manq)ulations, known azido- 
nosylate intermediate 2 is derived and carried through the previously rqwrted synthesis 

35 to obtain potential aspartyl protease inhibitor 1 {see E.K. Kick, J.A. EUman, /. Med. 
Chem. 38, 1427-1430 (1995)) {see. Figure 7). 
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The pynolidine amide 4 prepared in 3 steps in an overall 76% yield from 
commerciaDy available methyl (s)-(-)-2,2-dimethyH,3"dioxolane-4-caiboxylate, was 
coupled to benzyloxybenzyl bromide resin 5 using sodium hydride, tetrabutylammonium 
iodide, and catalytic 18-Crown-6 in THF for 2 hours at 45 (see Figure 8). Bromide 
resin 5 was derived from carbon tetrabromide, ti^henylphosphine, and commercially 
available Wang resin. 

Grignard addition in THF at O^C to support-bound pyrrolidine amide 6 
followed by oc-alkoxy chelation controlled reduction of the resulting ketone using zinc 
borohydride in diethyl ether at -20*C afforded secondary alcohol 7 in a 85:15 
diastereomeric mixture with the major diastereomer shown (see Figure 8). A small 
portion of secondary alcohol 7 was cleaved from the support to provide the 
corresponding triol product which was converted to the corresponding triacetate using 
acetic anhydride and DMAP (Dimethyl anuno pyridine). Diastereoselectivity was 
determined from GC analysis of the corresponding triacetates. No over alkylation from 
the Grignard addition was detected for all components used in the library. 

Secondary alcohol 7 was converted to azide 8 through the formation of a 
secondary nosylate using 4-nitrobenzenesulfonyI chloride and 4-pyrrolidinppyridine in 
chloroform foUowed by azide displacement with sodium azide in M^-dimethylformamide 
at SO^'C. The ;?-methoxy trityl protecting group was selectively removed using 1 % /?- 
toluenesulfonic acid in methylene chloride. Nosylation of the primary alcohol with 4- 
nitrobenzenesulfonyl chloride and pyridine in chloroform provided azido-nosylate 9. 

Amine displacement in A^methylpyrrolidinone (NMP) at 80**C followed by 
acylation with the desired caiboxylic acid, benzotriazole-l-yl-oxy-tris-pyrrolidino- 
phosphonium hexafluorophosphate (PyBOP), aza-l-hydroxybenzotriazole (HOAt) or 
isocyanate in NMP afforded intermediate 10 with the Pj, Rj, and sites of diversity in 
place. Reduction of the azide with tin(II) chloride, thiophenol, and triethylamine 
followed by acylation with the Rj caiboxylic acid, PyBOP, and HOAt, and lasdy, 
cleavage from the support using a trifluoroacetic acid:methylene chloride (90:10) mixture 
provided the desired potential aspartyl protease inhibitor la. 

A library of 204 compounds was derived from the components in Figure 
9. The most potent inhibitors of Cathepsin D were synthesized on a laiger scale, 
purified, and biologically assayed to determine K, values as detailed in Table 5. Overall 
yields of these scaled-up inhibitors ranged from 46-48% for the entire 12 step solid-phase 
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synthesis as detennined by the mass balance of desiied product after column 
chromatography puniication. 



Tabk 5. Inhibition constants for selected compounds (KJ 



Inhibitor 



Code Kj 
(P, R, Rj R3) (nM) 



OveraU Yield 
(12 sieps) 



Kbcf 




1.9±0.2 



46% 



Gbcf 2.6±0.2 48% 
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Synthesis Qf inhjbiiors 

Several of the most potent compounds were synthesized on an average of 
115 milligram scale on the . solid support following the aforementioned method. These 
compounds were purified by column chromatography and characterized by NMR and 
elemental analysis. Overall yields of the compounds were based on the entire 12 step 
solid-phase synthesis and determined by the mass balance of desired product after column 
chromatography purification. The characterization data are listed with the corresponding 
compound code. The *H NMR data is rqx)rted for the major amide rotomer of the 
major diastereomer for each compound. 
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Kbcf. (57 mg. 46%) 'H NMR (400 MHz. CDCl,) d 2.65 (m, 2H), 2.88 (apparent t. J 
= 7.7, 2H), 3.01 (apparent t, J = 6.9, 2H), 3.24 (m, IH), 3.47 (m, 2H), 3.83 - 3.96 
(m, 4H), 3.85 (s. 3H), 3.89 (s, 3H), 4.34 (apparent q, J = 8.3, IH), 4.66 (br. s, IH), 
6.71 (d, J = 9.2, IH), 6.84 (dd. J = 1.7. 8.0. IH). 6.93 - 7.00 (m, 5H), 7.05 (m. IH). 
7.05 (s, IH), 7.07 (s. IH), 7.16 (dd, J = 2.1, 8.1, IH), 7.23 - 7.30 (m, 3H), 7.34 (d, J 
= 2.1, IH), 7.71 (dd, J = 3.1, 5.4, 2H). 7.83 (dd, J = 3.1, 5.4, 2H). Anal, calc'd for 
C44H4oNACl:Br,: C, 59.41; H, 4.53; N, 4.72. Pound: C, 59.22; H, 4.76; N, 4.52. 

Gbcf. (48 mg, 48%) »H NMR (400 MHz. CDCl^ d 2.62 (apparent t, J ^ 7.5, 2H), 
2.82 (apparent t. J = 7.6. 2H), 3.18 - 3.25 (m, 3H), 3.40 - 3.47 (m. 2H), 3.57 (s. 3H), 
3.85 (s, 3H), 3.91 - 3.96 (m, 4H), 4.47 (parent q, J = 8.4, IH), 4.76 (br. s, IH), 
6.69 (s, IH). 6.92 (d, J = 8.2. IH), 6.95 (s. IH), 7.04 (dd, J = 2.1, 8.2, IH), 7.29 (d. 
J = 2.1. IH), 7.40 - 7.45 (m, 3H), 7.68 (dd. J = 3.0, 5.5, 2H), 7.71 - 7.80 (m, 6H). 
Anal, calc'd for C4jH3sN30,Cl^r,: C, 59.52; H, 4.52; N. 4.96. Found: C. 59.63; H. 
4.67; N, 4.69. 

Obcf. (55 mg, 48%) 'H NMR (400 MHz. CDCl,) d 2.65 (m. 2H). 2.85 (apparent t. J 
= 7.3, 2H), 3.08 (apparent t. J = 6.7, 2H), 3.23 (m. IH), 3.44 (m, IH), 3.57 (m, IH), 
3.75 (s, 3H). 3.86 (s, 3H), 3.94 (m, 4H), 4.39 (apparent q, J = 8.3, IH), 4.73 (br. s, 
IH), 6.78 (d, J = 9.2, IH), 6.93 (s, IH), 6.97 (s, IH), 7.02 (d, J = 8.2, IH), 7.10 
(dd. J = 2.1. 8.2, IH). 7.30 (d, J = 2.1. IH), 7.36 - 7.42 (m. 5H), 7.51 - 7.54 (m, 
4H), 7.68 (dd. J = 3.0, 5.4, 2H), 7.81 (dd, J = 3.0, 5.4, 2H). Anal, calc'd for 
C4«H4oNACl^r,: C, 60.49; H. 4.62; N, 4.81. Found: C, 60.23; H, 4.86; N, 4.58. 

Qbcf. (55 mg, 46%) 'H NMR (400 MHz, CDCI3) d 2.64 (m, 2H), 2.86 (apparent t, J 
= 7.1, 2H). 2.96 (m, 2H), 3.20 (m, IH), 3.46 (m, IH), 3.54 (m, IH). 3.78 (m, 2H), 
3.82 (s, 3H). 3.86 (s, 3H), 3.91 (m. 2H), 4.31 (apparent q, J = 8.5, IH), 4.73 (br. s, 
IH), 6.73 (d. J = 9.3. IH). 6.85 (s. IH). 6.96 (s. IH). 7.03 (d. J = 8.3. IH), 7.14 (m. 
2H), 7.16 (dd. J = 2.2, 8.3. IH). 7.32 (d. J = 2.2, IH), 7.37 - 7.41 (m, 2H), 7.70 
(dd, J = 3.0, 5.5, 2H), 7.80 (dd, J = 3.0, 5.5, 2H). Anal, calc'd for 
CjiKjiNjOrCl^r,: C, 52.08; H, 4.03; N, 4.79. Found: C. 52.28; H, 4.09; N, 4.60. 
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G. BmMi 

Novel low nanomolar inhibitors of cathepsin D were identified rapidly 
using combinatorial chemistiy coupled with two different computational strategies. The 
diverse and directed libraries together yielded over 90 compounds active at 1 /iM and 26 
active in the submicromolar range. The "hit rate" for activity at 1 /iM is 6-7% for the 
directed library and 2-3% for the diverse library. Even though both the directed and 
diverse libraries are based on the "active** epimer of the scaffold, the results from the 
directed library are clearly superior. At all concentrations ^ 1 pM, there were more 
"hits" in the directed library than the diverse library. The most potent inhibitors from 
the directed library are 3-4 fold better than those in the diverse library. It is clear from 
the results that the number and quality of the active compounds can be increased by 
using relevant information about the target. 

A strength of the structure-based procedure is that it leads directly to 
testable geometrical hypotheses. In this study there are three hypotheses: 1) S q)imers 
are predicted to bind better than the R epimers; 2) there are two energetically reasonable 
scaffold conformations (family l-h2, family 3+4), which place R groups into different 
pockets; 3) all the inhibitors are assumed to bind in approximately the same orientation 
as pepstatin. 

The first hypothesis was directly tested in pilot experiments where no 
inhibitors based upon the R epimer had activity at 1 ;*M. In addition, the R epimer of 
one of the most potent compounds had a Ki no better than 5 fiM while the of the 5 
epimer was 15 nM (see, Table 4). This conclusion and the inhibitor orientations in the 
cathepsin D complex will be examined crystallographically. 

Using the methodology described herein, active compounds can be 
identified and then the activity is optimized. The optimization criteria can include 
improved potency, selectivity, pharmacokinetic properties, or reduced toxicity. Each of 
these issues appears amenable to library design. For example, compounds with five-sbc 
fold improved potencies were rapidly identified by synthesizing and screening a small 
second generation library that explored variants of the most active compounds. 

The success of the directed library in finding potent inhibitors 
demonstrates the power of coupling combinatorial libraries with structure-based design. 
Combinatorial libraries allow a larger area of molecular space to be explored with the 
functionality selected by the structure-based design, removing the need to identify in 
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advance a single "best" target. Similarly, computational methods allow n^id 
examination of extremely large virtual regimes > 10^^ compounds) and focus the 
chemical efforts into productive regimes. 

It is to be understood that the above description is intended to be 
5 illustrative and not restrictive. Many embodiments will be apparent to those of skill in 
the art upon reading the above descrq)tion. The scq>e of the invention should, therefore, 
be determined not with reference to the above description, but should instead be 
determined with reference to the appended claims, along with the full scope of 
equivalents to which such claims are entitled. The disclosures of all articles and 
10 references, including patent applications and publications, are incoiporated herein by 
reference for all puipose. 
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WHAT IS C l.AlM^n TS! 



1 1. A compound having the fonnula 




Fonnula I 

2 

3 wherein: 

4 Ri, and Rj are members indq)endently selected from the group 

5 consisting of alkyl, substituted alkyl, aiyl, substituted aiyl, 

6 aiylalkyl, substituted aiylalkyl, aiyloxyalkyl, substituted 

7 aryloxyalkyl, heteroaiyl, substituted heteroaryl, heteroaiylalkyl, 

8 substituted heteroarylalkyl, heterocycles, substituted heterocycles, 

9 heterocyclicalkyl and substituted heterocyclicalkyl; 

10 R5 and R« are independently selected from the group consisting of 

11 hydrogen, halogen, alkyl, substituted alkyl, aryl, substituted axyl, 

12 aiylalkyl, substituted arylalkyl, aiyloxyalkyl and substituted 

13 aiyloxyalkyl; or and and the caibons to which they are bound 

14 join to fonn an optionally substituted 9- or 10-ring atom 

15 caibocyclic or heterocyclic, fused ring system. 

1 2. The compound in accordance with claim 1 wherein R, is a 

2 member selected from the group consisting of heteroaiylalkyl and substituted aiylalkyl. 

1 3. The compound in accordance with claim 1 wherein R, is a 

2 member selected from the group consisting of: 




4. Hie compound in accordance with claim 1 wherein is a 
member selected ftom the group consisting of heteroarylalkyl, substituted aiylalkyl and 
aiyloxyalkyl. 

5. The compound in accordance with claim 1 wherein is a 
member selected from the group consisting of: 




6. The compound in accordance with claim 1 wherein Rj is a 
member selected from the group consisting of substituted aryl, heteroarylalkyl and 
aryloxyalkyl. 



?• The compound in accordance with claim 1 wherein R,, R^ and the 
carbons to which they are bound join to form an optionally substituted napthalene ring. 
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8. The compound in accordance with claim 1 wherein and are 

both hydrogen. 

9. The compound in accordance with claim 1 wherein is 
hydrogen and R^ is meta or para substituent selected from the group consisting of 
halogen, alkyl, substituted alkyl, aiyl, substituted aiyl, aiylalkyl, substituted aiylalkyl, 
aiyloxyalkyl and substituted aryloxyalkyl. 

10. The compound in accordance with claim 1 wherein Rj is a 
member selected from the group consisting of: 




11. The compound in accordance with claim 1 wherein said 
compound is selected from the group consisting of 
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13. A methcxl of detecting the presence of cathepsin D in a biological 
sample, said method comprising: 

(a) contacting said biological sample with a compound having the fonnula 



wherein: Rj, Rj and R3 alkyl, substituted alkyl, aryl, substituted aryl, 
arylalkyl, substituted arylalkyl, aryloxyalkyl, substituted 
aryloxyalkyl, heteroaiyl, substituted heteroaryl, heteroarylalkyl, 
substituted heteroarylalkyl, heterocycles, substituted heterocycles, 
heterocyclicalkyl and substituted heterocyclicalkyl, 

R5 and R^ are independently selected from the group consisting of 

hydrogen, halogen, alkyl, substituted alkyl, aryl, substituted aryl, 



arylalkyl, substituted arylalkyl, aryloxyalkyl and substituted 
aiyloxyalkyl; or R^ and R* and the carbons to which they are bound 
join to form an optionally substituted 9- or 10-ring atom 
caibocyclic or heterocyclic fused ring system, 



or a radiolabeled analog thereof; and 

(b) detecting the presence of complexes formed by the binding of said 




Fonnula I 



compound to cathepsin D as a measure of the presence of cathepsin 
in said biological sample. 



14. The method in accordance with claim 13 wherein R, of said 
compound is a member selected from the group consisting of heteroarylalkyl and 
substituted arylalkyl. 
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15. The method in accordance with claim 13 wherein R] of said 
compound is a member selected ^m the group consisting of: 



a 



and 



Me 




CH2- 



16. The method in accordance with claim 13 wherein of said 
compound is a member selected from the group consisting of heteroaiylalkyl, substituted 
arylalkyl and aiyloxyalkyl. 

17. The method in accordance with claim 13 wherein Rj and R« and 
the caibons to which they are bound join to fonn an optionally substituted naphthalene, 
ring. 



18. The method in accordance with claim 13 wherein of said 
compound is a member selected from the group consisting of: 

o 9 




19. The method in accordance with claim 13 wherein Rj of said 
compound is a member selected from the group consisting of: 
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1 20. The method in accordance with claim 13 wherein Rj of said 

2 compound is a member selected from the group consisting of substituted aryl, 

3 heteroarylalkyl and aryloxyalkyl, 

1 21. The method in accordance with claim 13 wherein R5, R< and the 

2 carbons to which they are bound join to fonn an optionally substituted napthalene ring. 

1 22. The method in accordance with claim 13 wherein R5 and are 

2 both hydrogen. 

1 23. The method in accordance with claim 13 wherein R5 is hydrogen 

2 and is a meta or para substituent selected from the group consisting of halogen, alkyl, 

3 substituted alkyl, aryl, substituted aryl, aiylalkyl, substituted arylalkyl, aiyloxyalkyl and 

4 substituted aryloxyalkyl. 

^ 24. A method of inhibiting cathepsin D, said method comprising 

2 contacting cathepsin with a compound having the formula 
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4 wherein: 

5 R3 alkyl, substituted alkyl, aiyl, substituted aiyl, aiylalkyl, 

6 substituted axylallcyl, aryloxyalkyl, substituted aryloxyalkyl, 

7 heteroaiyl, substituted heteroaiyl, heteroaiylalkyl, substituted 

8 heteroaiylalkyl, heterocycles, substituted heteiocycles, 

9 heterocyclicalkyl and substituted heterocyclicalkyl; 

R3 and are independently selected from the group consisting of 

hydrogen, halogen, alkyl, substituted alkyl, aryl, substituted aryl, 
arylalkyl, substituted arylalkyl, aryloxyalkyl and substituted 
aryloxyalkyl; or and R* and the carbons to which they are bound 



join to form an optionally substituted 9- or 10-ring atom 
caibocyclic or heterocyclic fiised ring system. 



1 25. The method in accordance with claim 24 wherein R, of said 

2 compound is a member selected from the group consisting of heteroarylalkyl and 

3 substituted arylalkyl. 

1 26. The method in accordance with claim 24 wherem R, of said 

2 compound is a member selected from the group consisting of: 



a 




3 
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1 27, The method in accordance with claim 24 wherein of said 

2 compound is a member selected from the group consisting of heteroaxylalkyl, substituted 

3 arylalkyl and aiyloxyalkyl. 

1 28. Hie method in accordance with claim 24 wherein of said 

2 compound is a member selected from the group consisting of: 




1 29. The method in accordance with claim 24 wherein R3 of said 

2 compound is a member selected from the group consisting of substituted aryl, 

3 heteroarylalkyl and aryloxyalkyl 



1 
2 



30. The method in accordance with claim 24 wherein R3 of said 
compound is a member selected from the group consisting of: 
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3 

1 31. The method in accordance with claim 24, wherein and R« and 

2 the carbons to which they are bound join to form an optionally substituted napthaiene 

3 ring. 

1 32. The method in accordance with claim 24. wherein R5 and are 

2 both hydrogen. 

1 33. The method in accordance with claim 24, wherein R5 is hydrogen 

2 and R« is a meta or para substituent selected from the group consisting of halogen, alkyl, 

3 substituted alkyl, aryl, substituted aryl, arylalkyl, substituted arylalkyl, aryloxyalkyl and 

4 substituted aryloxyalkyl. 

1 34. A method of inhibiting protein processing by cathq)sin D in living 

2 cells, said method comprising contacting said cells with an effective amount of a 

3 compound having the formula 
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4 



5 wherein: 

6 Ri, Rj and R3 alkyl, substituted alkyl, aryl, substituted aryl, arylalkyl, 

7 substituted aiylalkyi, aiyloxyalkyl, substituted aryloxyalkyl, 

8 heteroaryl, substituted heteroaiyl, heteroarylalkyl, substituted 

9 heteroarylalkyl, heterocycles, substituted heterocycles^ 
10 heterocyclicalkyl and substituted heterocyclicalkyl; 

R5 and R« are independently selected from the group consisting of 

hydrogen, halogen, alkyl, substituted alkyl, aryl, substituted aryl. 



arylalkyl, substituted arylalkyl, aryloxyalkyl and substituted 
aryloxyalkyl; or and R* and the carbons to which they are bound 
join to form an optionally substituted 9- or lO-ring atom 
carbocyclic or heterocyclic fused ring system. 



1 35. The method in accordance with claim 34 wherein R, of said 

2 compound is a member selected from the group consisting of heteroarylalkyl and 

3 substituted arylalkyl. 

1 36. The method in accordance with claim 34 wherein Rj of said 

2 compound is a member selected from the group consisting of: 



a 
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37« The method in accordance with claim 34 wherein of said 
compound is a member selected from the group consisting of heteroarylalkyl, substituted 
arylalkyl and aiyloxyalkyL 

38. The method in accordance with claim 34 wherein of said 
compound is a member selected from the group consisting of: 



a 

a 





39. The method in accordance with claim 34 wherein R3 of said 
compound is a member selected from the group consisting of substituted aryl, 
heteroarylalkyl and aiyloxyalkyL 

40. The method in accordance with claim 34 wherein R5 and R« and 
the carbons to which they are bound join to form an optionally substituted napthalene 
ring. 

41. The method in accordance with claim 34 wherein R5 and R« are 

both hydrogen. 

42. The method in accordance with claim 34 wherein R3 is hydrogen 
and R^ is a meta or para substituent selected from the group consisting of halogen, alkyl, 
substituted alkyl, aryl, substituted aiyl, arylalkyl, substituted aiylalkyl, aiyloxyalkyl and 
substituted aiyloxyalkyL 



43. The method in accordance with claim 34 wherein R3 of said 
compound is a member selected from the group consisting of: 
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1 44. The method in accordance with claim 34 wherein said compound 

2 is selected from the group consisting of 
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1 45. The method in accordance with claim 34 wherein said compound 

2 is selected from the group consisting of 
3 



a 
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46. A phannaceutical composition comprising a phannaceutically 



acceptable excipient and a cathq)sin D inhibitor, said cathepsin D inhibitor having the 
formula 



R„ R2 and R3 alkyl, substituted alkyl, aryl, substituted aryl, arylalkyl, 
substituted arylalkyl, aryloxyalkyl, substituted aryloxyalkyl, 
heteroaryl, substituted heteroaryl, heteroarylalkyl, substituted 
heteroarylalkyl, heterocycles, substituted heterocycles, 
heterocyclicalkyl and substituted heterocyclicalkyl. 

R5 and R« are independently selected from the group consisting of 

hydrogen, halogen, alkyl. substituted alkyl, aryl, substituted aryl, 



arylalkyl, substituted arylalkyl, aryloxyalkyl and substituted 
aryloxyalkyl; or R^ and R** and the carbons to which they are bound 




Fonnula I 



wherein: 



join to form an optionally substituted 9- or 10-ring atom 
carbocyclic or heterocyclic fused ring system. 
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